| HDFS | 3.2.1 | Apache Hadoop Distributed File System |
| YARN | 3.2.1 | Apache Hadoop NextGen MapReduce (YARN) |
| MapReduce2 | 3.2.1 | Apache Hadoop NextGen MapReduce (YARN) |
| Hive | 3.1.1 | Data warehouse system for ad-hoc queries & analysis of large datasets and table & storage management service |
| HBase | 2.2.2 | Non-relational distributed database and centralized service for configuration management & synchronization |
| ZooKeeper | 3.5.6 | Centralized service which provides highly reliable distributed coordination |
| Ambari Metrics | 0.1.0 | A system for metrics collection that provides storage and retrieval capability for metrics collected from the cluster |
| Ranger | 2.0.0 | Comprehensive security for Hadoop |
| Flink | 1.13.5 | Apache Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. |
| Kerberos | 1.10.3-30 | A computer network authentication protocol which works on the basis of 'tickets' to allow nodes communicating over a non-secure network to prove their identity to one another in a secure manner. |
| OCEANBASE | 3.1.2 | An opensource distributed relational database |
| Clickhouse | 19.3.6 | open source distributed column-oriented DBMS. |
| Dlink | 0.6.0 | Apache Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. |
| Dolphin Scheduler | 1.3.8 | 分布式易扩展的可视化DAG工作流任务调度系统 |
| Apache Doris | 0.11.0 | Apache Doris |
| Elasticsearch | 7.2.0 | Indexing and Search |
| Grafana | 5.2.4 | Dashboard |
| Impala | 3.2.0 | an open source, analytic MPP database for Apache Hadoop that provides the fastest time-to-insight |
| Kudu | 1.13.0 | A new addition to the open source Apache Hadoop ecosystem, Apache Kudu completes Hadoop's storage layer to enable fast analytics on fast data. |
| Kyuubi | 1.5.0 | Apache Kyuubi, a distributed and multi-tenant gateway to provide serverless SQL on lakehouses |
| Presto | 0.303 | Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. |
| Redis | 5.0 | Redis is an in-memory data structure store, used as database, cache and message broker |
| Spark2 | 2.4.8 | Apache Spark is a fast and general engine for large-scale data processing. |
| Sqoop | 1.4.7 | Tool for transferring bulk data between Apache Hadoop and structured data stores such as relational databases |
| Tez | 0.9.2 | Tez is the next generation Hadoop Query Processing framework written on top of YARN |
| ZEPPELIN | 0.8.0 | A web-based notebook that enables interactive data analytics. It enables you to make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more. |